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K PREFACE 


A major oonoorn ol NASA's Taohnology Translar Division 
has baan to datarmina tha most aiiaetiva mathods lor 
matching applications ol NASA's tachnology with tha naads 
and intarasts ol non-NASA tachnolegists and sciantista« A 
racurrant thama in tha uork parlormad lor NASA at Stonlord 
Univarsity has baan to invastigata thosa mathods uhich allou 
msKimum usar involvamant in tha salaotion ol ihlorAation 
doamad ralavant to his eoncarns. This is viauad as 
important^ sinca it is hald that tha ultimata "innovator#** 
or **raan on tha banch" is tha bast judga ol uhat is ol usa to 
him. Thus# minimization of unnacassary pre-scraaning and 
pra-salaction by third parties battar allows tha ultimata 
usar to anarcisa his oun judgamant# and to maka mora 
allactiva usa ol tha inlormation prasantad to him. 

At tha sama tima# houavar# it is claar that aliminating 
or raducing pra-salaction ol inlormation loads diractly to 
highar usar costs# bacausa tha amount ol inlormation aach 
usar must scraan is magniliad anormously. 

Using machina raadabla data basas with an intaractiva 
saarching algorithm can ba a maans lor gatting inlormation 
diractly to thosa uho will usa it# with minimal Inuraasa in 
tha user's costs. As an instrumant ol technology translar# 
HASA*s RECON lilas could concaivably piny an oven graatar 
rola in oKposing tha public to aval labia NASA technology . 
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Hith t.hii idea •» backgrounds ^.hia papar invaatigatas tha 
date baaa induatr/i and tha iunctlona that NASA alraady 
partorma aa a nambar ot that induatry. 


2. INTRODUCTION 


Th* growth oi naohino roaOablo data basaa has boon rapid 
ovar tha lost iiitaan yaars. currant ly ovar 1000 
ooaputarisad data basas provida inlorsiation in all araas oi 
tha natural and social soiancas* arts and humanitias» and 
buainass and public policy* A raport Iron Intarnational 
Raspurea O&valopmant astinatas ravanua iron tha supply and 
distribution oi on*‘lina data basas at 01.25 billion in 1981 > 
uith growth to 05. 5 billion in 1991.' 

This papar describas a sanpla oi tha nachina raadabla 
data basas availabla to tha tachnologist and resaarohar in 
the natural scianoas and anginaaring n and compares thorn with 
tha data basas and data base services oiiarad by NASA. 

The data base industry can ba sagnantad into three 
cat4!igori as > ioll!L*uing the categorisation oi Roger 
Christian.* Christian attanptad to distinguish three sectors 
oi this industryi the pubiiiharsr distributors, and users. 
The problem with this simple segmentation is that one 
individual or iirm may operate in more than one oi these 
saotors. Distributors publish their own data bases. 
Publishers market to academic and special libraries, as well 
as to commercial vendors (who in turn market to academic and 


Journal . narch 23. 1931 


* Christian. Roger: Xtift LlfCt f.ftnitt ,Li.bra.LK» Dibliooraphie 

Data Bases 1 97 5-76 Knowledge Industry Publications: White 

Plains. New York: 1975; page 4 




tp«oi«l libraript). Librarlts that buy • data baaa may 
eompata diractly uith tha aupplyinp vandor or publiahar. 
Tho govornmant aalla* buya» and diatributaa to> lrom» and in 
oompatition uith privata vandora and pubiishara. Finally* a 
publishar’a oun data baaa and printad iarvicas may oompata 
uith aach othar. 

Houavar* daspita tha oitan complaK ilou of* goods in this 
industry* eatagorising ths data bast(\ industry into usars* 
publishars* and distributors is usad in this study as a 
usaiul classification ok tha industry. 

NASA performs tha functions of all thraa sactcrs in tha 
data basa industry. NASA publishes its oun data basas* 
called tha RECON/NASA filas, NASA not only compilas much of 
thasa filas* but publishas tham in machina raadabla form* 
and distributas them to its user group through tha 
Industrial Applications Cantors (lAC's)* or tho Stata 
Tachnology Applications Csntors (STAC's)* or by diract 
dial'-in accass to NASA's computar cantor in Maryland . 
Finally* NASA is itseli a subset of the entire user sat. 
Much of tha information on tha RCCON/NASA files is primarily 
of use uithin NASA* and tha files are uidely used by 
in-house rasaarchars and soiantists. 

Non-NASA users have access to tha RECON files either 
through the lAC's and STAC's* or through an arrangement 
uhareby prime oontractors uith NASA may have direct on-line 
access to the RECON files. 
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3, FUNCTIONS ASSUMED SY NASA ZN THE DATA’ BASE INDUSTRY 

Zn th« machin* raadabla data basa induatrF* NASA not onl/ 
aacvas tha iunetiona oi publiahlng* distribiittnD* and using 
its oun data basaa> but uau raaponaibla for davaloping 
RECON. RCCON is an intaraetiva indax and taxt saarching 
systamf which uaa tha praeursor to Lockhaad's currant DIALOG 
systam. Through RECON*^ usars can saarch savarai typas oi 
data iilas. Thais iilas ara listad in Tabis 1» which g,ivai 
thu numbar oi antrisi as oi July K 1980t 

Approximataly 70k oi tha RECON antrias ara conprisad by 
STAR and lAA. Entriai in STAR inoluda: NASA. NASA 
contractor. and NASA grcntaa reports; raports oi other 
govarnmant agancias. univisitias. private iirmi. and 
domestic and ioraign institutions; translated reports t NASA 
owned patents and patent applications; and dissertations and 
theses. STAR covers all aspects oi aeronautics and space 
research and davalepmant; related basic and applied 
research; and applications including earth resources/, energy 
devalopmant. conserva'iion. oceanography. environmental 
protection, urban transportation, and topics oi national 
interes'^ . 

Entries in lAA includat periodicals (including govarnmant 
sponsored journals); books: maating papars and coniaranca 
proceedings oi proiassional and aeademic societies. and 
translations oi journals and journal articles. The subject 
matter is aeronautics and space sclanea and technology. 
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TAILC t 



DATA FILES ON THE 

RECON DATA 

DASE 



As oi July 

1r 1980 




Document 

files 

Number oi 

Peroontage 

Name 

Description 


Entries 

oi Total 


STAR Ssicntliic and Tachnioal 


lAA 

Aerospace Reports 
International Aerospace 

45 : 1.000 

29 . 4X 


Abstracts 

609.000 

39. 6X 

OSTARE 

'•Old'’ STAR 

148.000 

9.6X 

LSTAR 

Limited STAR 

81.000 

5.3X 

CSTAR 

Confidential STAR 

144.000 

9. 4X 

Others 

Tech Briefs, some ASRDI 

104.000 

6.7X 


Total 1.540,000 100, OX 


Special riles 

Hama Oaseriptlon Number of Entries 


CPA 

Computer Progi^am Abstracts 

2.300 

R&DCS 

NASA Contract Directory 

14.600 

RTOPS 

Research and Technology 



Operating Plan 

7.000 

ASRDI (Fire) 

Safety File'*’’rire 

4.000 

ASRDI(Cryo) 

Safety File*— Cryogenics 

7 . 000 

ASRDKHech/Struc) 

Safety Fi le’"*Mechanieal 



and Structural 

900 

Tech Briefs 

Tech Briefs 

8.300 


nalket 



(NASA Library Netuork) 


Subject 

Availability 

Number 

Books 

NASA Holdings 

74.000 

Books 

MARC Tapes 

270,000 

Periodicals 


7.600 


Other files on the RCCON system include OSTARE# uhieh is 
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• iMiVar to STAR* listi unolaiailiai) dooumanta* and ia oldar 
than STAR. Tha RCOCS* or NASA Contract Dtraotory* providaa 
tha contract numbar* tachnioal Monitor* cantar* and 
principal invaatigator ior NASA contracts. Tha RTOP iila 
liata currant and oldar RTQP'a* uhich ara progran plana 
batuaan NASA Haadquartara and tha NASA riald Cantara. Tha 
collaction oi ASRDX filaa ia raiarrad to aa tha ''Saiaty 
rile." and liata raporta collaotad by tha Lauaa Raaaareh 
Center addraaaing aaiaty iaauas in aaph oi the listed areas. 
Finally* tha Tech Briais iila lists NASA technology 
available ior commareialisation* and contains post ol the 
iniormation in the print version oi Tech Briefs . The 
hardcopy Tech Briefs has traditionally served as a major 
vehicle ior transferring iniormation concerning NASA 
technology to the private sector* and as result there saams 
to be greater public anposura to tha hardcopy Tech Briefs 
than to the machine readable version. 

Some of these data files* in particular tha ASRDZ* 
NALNET* RTOP's* and RCDCS filaa* may be of little interest 
outside bha community of NASA parsonnal and associated 
contractors. Houavar* other files* such as STAR* lAA* CPA* 
and tha Tech Briefs may be of considarabla utility to 
engineers and technologists outside this community. 

Historical ly* there has bean public (nen-NASAl access to 
these files through either of tuo avenues. Those members of 


th« public uho arc not priwo contractors with KASA »ay 
submit a search raquast uith a rapional lAC or 3TAC» and the 
NASA staii uill pariorm tha search. This branch ol the 
Taohnolor>y Transiar Program oilers machine and manual 

a 

searching> historical as well as Selective Dissemination ol 
Iniornation (5D!)**->or currant Auareness-**searohing> and the 
support stall to interpret and analyze the results ol the 
searoh . 

prime contractors with NASA may obtain on-line use ol the 
RECON lilcs undei' a development program to allow greater 
numbers ol userr. dial'^up access to the RCCON system. Xn the 
past, the number ol direct acoess dial-up ports that RECON's 
computer could handle has been severely limited. A new 
Iront-end processor and larger computer uill greatly enhance 
capacity. As an example ol the advances made in expanding 
its capacity and response, it is reported that RECON 
response time is down to about tuo to tuo and one*hall 
seconds, as opposed to the approximately 26 second average 
uait in the early stages of the system,^ 

Although much ol the lile content ol NASA’s RCCON may 
only be directly pertinent to those closely connected uith 
NASA, there is a substantial amount ol information that may 
be useful outside NASA. So i noil 1979 makes the point that 


figures based on a conversation uith Mr. Bill Broun ol 
NASA. 
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through th^ 1AC*S and STAC’s# a non**NASA inguirar haa aooaaa 
to ovtr 10 nillion doeu«anta» and oi thaaa* t.S pillion ara 
in NASA/RCCON* In all* approKipattly tS*000 seiantilie and 
taohnioal journala uorlduida ara eovarad* a« ara 
publioationa irop othar govarnpant aganoiaa.^ 

Thua* NASA 'a data haaa anauara ta%nnioal and ralaranca 
quastions ior non-*NASA uaara and NASA uaara and oontraotora* 
and alao aarvaa aa an intarnal raaaaroh diraetory lor tha 
community ol NASA amployaaa and oontraetora. In tarma oi 
taohnology tranalar* NASA uaaa machina raadabla data baaaa 
to tranamit iniormation to privata aaetor tachnologiata 
through tuo avanuaa. Ona avanua haa boon madiatad through 
tha lAC'a and StA^J'a* and tf a othar ia direct and 
conditional on tha praraquiaita oi baing a NASA prima 
contractor , 

Ona quaation uhich can ba raiaad impadiataly* in light oi 
NASA *3 plana oi anpanding RECON capacity and accasa* iat to 
uhat axtant hava ayatam lipitationa lad NASA to treat tha 
RECON liloa aa a '^private raaeurca?'* Limitad aecaaa to 
RECON through XAC*a and STAC'a ia an aiiieiant 
distributional prooadura uhan harduare limitationa pravant 
large numbers oi dial-up porta. li a "peak load" oi 
independent users cannot be mat satisiactorily by a 
constrained system* then it is obviously more aiiicient to 


Spinoff 1979 . NASA's Oliiea oi Space and Terras tial 

Applications; page 112. Figuras ara ior 1979. 
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iorot fill ■yitam dtmands to bo fproad out ovor tint*. A 
rtooonoblo maohaniom ior doing this la to ehann«'l alli or 
alMoot all ayitam danand through agonta who ulli naat aaeh 
partioular danand aaquantially. It appaora that tha IA(‘*a 
and STAC'a hava parlornad thia rola quita aliactivaly. Aa 
tha capoeity of tha RECOH ayatan la aKpandad* it raa * tioU ba 
worth anphaaizing tha othar aarvioaa ol farad by NASA 
Taohnology Tranafar aganta/. aa tha Inportanca of thair rola 
aa intarmadiariaa to RECON can ba dinlniahad. 

Thia mattar will ba daalt with again> following a 
discuaaion of a lanpling of tha nachina raadabla data baaea 
availabla to prlvata aastor taohnol ogista in tha non-NASA 


aactor . 


H. k SAnPlE or PUBLICALLY AVAXIABLC nAC?iIM!: READABbB DATA 

BASES 


A ■•Mplft oi dato baias uaa »ada to eoapara tha loopa ol 
oovaraga and aarvieaa oilarad in tha aaohina raadabia data 
baaa industry with that ollarad through HASA’s RECOK. Tha 
sampla }ias limitad to taohnieal and aeiantifio data basa*» 
as thcfio uara assumad to ba oi graatast intarast to privata 
sactor sciantists and tachnoiogists . fha eompariton uss 
mada on tha basis oi subjacts covarad* typa oi inior>nation 
raeordad in tha data basa> sourca oi iniormation raoordod in 
tha data basa> numbar oi years eovarad* quantity oi 
information and annual additions to tha data basti 
approKimata yearly ohargas for acquisition or laasa oi tha 
data basa> and aisoeiatad esarvieas. 

Tha sampla comprised hZ data bases* and uas draun iron a 
master list oi seiantiiic and taohnieal data bases. Tha 
master list uas assembled from Information Hark at Place 
t978~1979 . uhich uas saleotad as principal raiaranea 
document. Iniornation Market Place is a relatively currant 
and eoraplata international compendium oi iniormation sources 
that are publieally available. Appendix A is a copy oi tha 
Table oi Contents oi information Market Plaea . Uhich is 
included to illustrate the document's scope oi coverage. 


In selecting the iniormation source.5 to be included in 
their doeumant* tha editors oi Iniornation Market p,lng^ 


Inlloucd • narrow dafinition of uhot constitutas inlornation 

products and sarvioas. Thair dafiniiion raads as folloust 

Emphasis is on thosa organizationa uhieU» by tha 
application ol advanoad tachnologias * craata and 
gathar iniormation and add valua by pariorning ona 
or mora of tha following oparationsi organizing 
itt raarranging it» adding othar facts* matting it 
mora availabla* or by oonvarting it into a naw 
medium. > 

Guidad by this dafinition* tha aditors olai»^ to have 
undertaken antansiva research to locate all aganocA's and 
organizations involved in providing informatinn products and 
services internationally. Upon .identifying these 
organizations* the aditors mailed them questionnaires 
inquiring into thait? principal and related products and 
services. Most of the descriptions in the final directory 
appear to have been salf-selectad . Apparently* tha 
questionnaire was composed of a series of descriptions* and 
lists of pra-selacted subject areas which ware merely to be 
checked off by tha respondent. Certain phrases and 
combinations of words ware repeated quite frequently* 
particularly in describing tha subject areas of tha data 
bases. It is presumed that this is indeed how tha 
information was collected* and this raises tha question of 
how well the descriptions fit the actual content and 
services of tha dsta bases and data base eempanias* 



Oii on* h»n<l> this proosdurs nsds sll raspondsnts look si 
ih* sans quastionnaira uith tha ssna quastlons snd 
catasorias» and tharalora lands a dagraa ol uniiornity io 
tha iinal dasoripiions al data basas and data bssa sarvioas. 
This should aid in tha oonparabijU ty ol thaso daseriptions 
in Inlernatton Marti '.st rises . 

On tha other handr viith this procadura’s ralianoa on 
sell-reporting » there is the possibility thst each 
respondent intarv>ratad the questionnaire sub jectively* 
slthough the degree to uhioh on«i can impose subjective 
interpretation to uords uhich are ganerally uell dalinad 
through industry-uide use appears minimal. It is mors 
likely that variation in the quality ol reported inlormation 
could occur because ol varying individual perceptions 
concerning the eKpacted utility and likely return to 
oor^ributing to this directory i One can never be sure to 
uhat sKtont raspondants presented an exaggerated image ol 
their data base and data base services while perhaps trying 
to taka advantaga Irom soma Iraa advertising » or lailad to 
adequately describe their data base and services because 
they thought this endeavor one not likely to pay oil. One 
can only speculate on the extent ol these ellacts. il they 
exist at all. 

However, with a high voluntary participation rata among 
American data base publishers, and with certain indications 
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that data haaa aarvicaa uara insuliiciantly daaeribad* it 

% 

appcara that data baaa publishara paroaivad tha diraotory to 
ba in thair oun intaraat# and that thaaa publiahara did not 
aKaggarata tha axtant ol thair oata baaa oovaraga and data 

baaa aar*rioaa. 

If a data baaa publiahar did not raturn thair 
queationnaira. tha aditora ol the Inf armation Market P.jjg.g 
directory perlormad thair oun roaaaroh and prasantad only 
tha raaults oi this raaaarch in tha directory* with a nota 
to that affoct. Thus* it uaa posaibla to mtakura the 
participation rata among publishers uho had baan contacted 
by I.ni.9.rmn.Li.an a,a£.hLa.t Ll.ncc. For American companies this 
rate uas 100;i» and for all companies intarnationally tha 
rate uas 67X. As only American data baaa aompanias uara 
inoludad in the sample discussed in this papar> one can 
surmise that American companies sau this directory to ba in 
thair oun intarast» and that non~rasponsa of the publishers 
is not a problem for tha sample diseussad here. 

Evidence that tha publishers did not axaggarata tha 
extant of thair data base and data base services can ba 
observed in tha reported services that publishers provide in 
association uith thair data bases. For example* it uas 
reported that SkX of the data base publishers offered 
’'machine searching'* as a service. uharaas only 12X offered 
"retrospective searching." It seams illogical that there 
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ttpaolally sinca 


uould in fact ba such a high diaorapanoy# 
most on-lina saarehas acoass savaral yaars ol data. It 
saama more likaly that tha sarvioas actually pariornad by 
data basa publishars uara somauhat undar-raportad « or that 
thara was ambiguity in tha dasoriptiva labels usad in tha 
quastionnaira. In aithar casaii bha sampla discussod in this 
papar uill peasant only an incomplata pietura oi any singla 
publishar* although tha ovarall pietura across all 
publishars should ba adaquata. 


>4. t Ul£ SAUf-US 

Th<ira is a saction oi Information nblJkC.i £i&£A that lists 
machina readabla data bases uhieh ara publically availabla 
uo^ Iduida. A ’’data basa" is daiinad in this diraetory as a 
"collection of machina readable records uhioh are 
periodically updated and uhieh can ba proeassad on computers 
uith tha appropriate softuara."* Tha editors of information 
Market Place dasoribad approKinatal y 406 data, bases in this 
section. and these included technical and non-taehnieal > 
domestic and foreign data bases. 


* Information Market pasa 36 
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Th« tampl* of taolviiioal and soiantifie data baaaa uaa 
oollaotad in two stapa. Firat» all tha taohnioal and 
soiantilio donaatioally produead data baaaa uara aalaetad. 
Thia uaa dona by a proeaaa uharaby tha author aada tuo 
paaaaa through tha liat. In tha lirat paaa tha author 
aliminatad all data baaaa uhoaa daaoriptiona did not includa 
araaa ralatad to tha natural aciancaa and traditional 
tachnieal iialda. A aacond paas uaa mada to provida a 
higher dagrao oi objactivity to thia process. This second 
paas took advantage oi a subject index oi data bases uhioh 
is included in Iniormution Market Place. Scientific and 
technical subject areas uara selected, and tha data bases 
listed under each subject heading uara cross-checked against 
tha iirst selection to ensure that there uare no omissions. 
Appendix B contains a list oi tha subject areas checked. 

This procedure resulted in a list oi 128 domastioally 
produced data bases that could be oi direct interest to a 
private sector scientist or technologist. Oi course, it is 
aluays possible that certain data bases. such as the 
iinancial data bases. or indices oi industrial production, 
or even the listings oi historic homes, could be oi direct 
interest to these scientists and technologists. It is even 
more likely that certain foreign data bases, such as the 
European Space Agency's Space Components File. or the 
Carbon-13 Nuclear Magnetic Resonance CMR. or Hydromechanics 
and Hydraulic Engineering. uould be oi direct interest. 
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sinott %omm loraign data baaaa> 


This is an important points 
such as Aquatic Sciancas and Fisharias# ara avai labia 
on-lina through sarvicas such as Lockhaad*s DIALOG. 

Houavar. as spaciiiad baiora» non-tachnical or ioraign data 
basas uara onittad. An aapandad study should attampt to 
idantify othar typas oi data basas uhich could ba ralavant^ 
and uhich play a significant rola in tha data basa industry# 
as parcaivad from tha Amarican viaupoint. 

Tha working sampla uas selectad frma this list of 128 
alphabatically listad data bases by taking avary third 
listad data base. Thara uas an early oversight in that tha 
Paris" based Aquatic Sciences and Fisheries uas originally 
included in tha working sampla. This uas replaced by tha 
first entry on tha master list. uhich uas tha API Tech 

IndoK . 

Summary information on each data basa uas than collected 
from Information Market Place . and this information uas 
supplemented by data from certain annual reports# tha 
publication going, ul.tr R.tBd,tbU Mlk Aajl£1.> II Plrt.c,tgrY 
J2Juta Sourcebook, and from Excerots ixfim PirtCtorY Ai flJir.Llnt 
Data Bases . Tha basic information collected for each data 
base included its name.' publisher, type of covaraga (uhathar 
each data entry is bibliographic or non’-bibl iographie > with 
a description) # fields of covaraga. years covered, number of 
entries as of 1978 and numbar of annual additions, and uhera 


th« data basa is availabla ii availabla oii‘‘lina* AppandiM C 
lists tha namas of tha data basas in tha sampla> and tha 
publishar oi aaoh data basa. 

Entry 30 in Appandin C is RECON/NASA* which is listad as 
availabla through tha Xnowladga Availability Cantar* 
Univarsity of Pittsburgh. Tha RECON/NASA data basa is 
daseribad as bibliographic (listing citations). with 
multidisciplinary eovaraga oi anginaaring. scianoa. and 
project managemant. It covars tha yaars since 1963» had 
720.000 antrias in 1978 with 55.000 annual additions and is 
not listad as baing availabla on~lina to tha public. 


Tha description oi tha RECON/NASA files in Information 
MaJhet fJjR,ga diiiors from that given in SP.inoii 
because tha Information MarJtat Place description is only for 
the Knowledge Availability Canter (tha Pittsburgh **IAC*). 
Each lAC essentially has its own procedure for using tha 
RECON system. with tha New England Research Applications 
Canter (NERAC) even producing its own tapes. The 

description of RECON in information Market Pi ace is in fact 
a description of information sarvicas tha Pittsburgh lAC 
offers, rather than a complete description of the NASA/RECON 
system . 


This discrepancy is of interest, because it appears that 
tha extant of services available through tha lAC's and 
STAG'S, as well as the scope of RECOH/HASA. are both being 
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undar-r«port«d in • r«l«rttnoa doouvant auoh aa inforaation 
Markat Plaea . Uith thia inaight into tha mannar in uhieh 
data baaaa uara daacribad in inforaatton Markat £IifiA> tha 
eharaetariatios of tha data baaaa aaiaetad in tha aanpla can 
ba diaeuaaad. Tha raaulta will ba okaninad to aaaaaa tha 
natura of publioall/ availabla aeiantifio and taehnieal data 
baaaa , 


k.2 J ^SS UlXa m discussion 


Information Ma rkat Plaeo claaaifiad tha data baaaa aa 
either bibliographic or non-bibl iographic . A bibliographic 
data baaa uas intandad to aupply full bibliographic 
infornation of a publiahad documant. A non-bibliographie 
data base supplied spacific data* coaponant spaeif ioationa» 
descriptions* or non-publishad raports. information Market 
Place did not supply datailad dafinitiona on tha diffaranca 
batuaan thasa two typas* but an artiela by Ooszkocs* Rapp* 
and Schoolman discussed tha typas of availabla data baaaa in 
graater detail.’ 


’ Doszkocs. Rapp* Schooiman* "Autonatad Information 
Retrieval in Scienea and Technology*’* Science . April 4* 
1980 , pagos 25-30 
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Th*ir ditoussion oi bibliographio data baaaa> data banks* 
and tha •margins **knouladga** data basos halpa olarily tha 
olassiiications usad by IntqraitiOB tUrKft flAfiJI* Thasa 
authors oitad bibliographic data basas as thosa raiaranoirig 
publishad litaratura* and uhioh ara usad most oitan to 
looata an orticla or documant. Thasa tand to ba 
oomputarizad versions oi aKisting indexing and abstracting 
services. Examples include Engineering Index* Seianca 
Citation Index* and Government Reports Announcamants Index. 
Non-bibliographic data basas includa uhat these authors 
reiar to as "data banks" and ""knouledga basas." Data banks 
contain numeric and analytical data obtained from publishad 
literature* and oiten rafarance tha source oi iniormation. 
Examples include tha Hational Library oi Madieina's 
Registry oi Toxic Siiaets oi Chamioal Substances and 
Toxicology Data Bank; and tha Laboratory Animal Data Bank. 
Tha iormar tuo contain toxicological* chemical* and 
pharmacological data ior approximatlay 36*000 substances 
(listed by all their names and synonyms* and including their 
iormula)* and the latter provides husbandry conditions and 
physiological and pathological baseline data ior laboratory 
animal groups* and allows intaraotiva statistical analysis. 
Data irom the iormar two data basas comas irom publishad 
literature* whereas data ior the latter is obtained directly 
irom participating labcratori as . Finally the authors 
discuss the "knowledge data bases*" which they compare to 
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•noyolopcdia* or toMtbookSt Th«o« data !)•••■» auoh aa tha 
Hapatitis Knouladpa iaaa> rapraaant an analysia and 
aynthasia ol availabla knouladga. Thaaa aNaaplaa halp 
iliMatrata tha diliaranea batuaan bibi iographie and 
non-bibliographie data baaaa. 

Mhan tha data baaaa in tha aampla uara analyaad ior tha 
ralativa numbar ol bibliographic and non**bibl iegraphio data 
baaaa, tha olasailieation ol information Hnrkat PJLl&ft uaa 
uaad. Thia clasai I ioation uaa ganarally in agraamant uith 
that ol Doaskoca, Rapp# and Schoolman. 21 ol tha data baaaa 
in tha aampla uara daacribed aa bibliographic and 21 uara 
not. Although tha numbcra nacd not ba axact due to cartain 
dalinitional problama# thara doaa aaam to ba an avan aplit 
batuaan bibliographic and non**bibliographic data baaaa. 

Tha data baaaa uara than crcaa-tabulatad by bibliographic 
varaua non-bibliographie covaraga and multipla varaua aingla 
fiald covaraga. A data baaa uaa dalinad aa having multipla 
liald covaraga il ita aubjaet araa included mora than ona 
aaparata liald# avan il thoaa lialda are related. A data 
base uith single liald covaraga had# obviously# a subject 
area in only one distinct liald. 

As an illustration ol this eroas-tabulatad elassi lication 
schema# considar the lol lowing axamplas: naritima Research 
inlormation Service (tlRIS) counted as a bibliographic data 
base uith multipla Held covaraga# since lniormatr.on Market 
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liitsd it 


"bibliographies** 


indaMing taehnioal 


raportts journala* and oonlaranoa papars. Xta Hating ol 
aultipla iialda inoludad narina tranaportatiom pollutions 
buainaaas ohoMiatrys laus aatallurgys and tualva othar 
oatagoriaa. CancarproJ eountad as a bibliographic data baca 
uith singla Hold oovaragai as it ia dascribad aa a 
bibliographic listing oi currant eancar rasaareh proJactSf 
and its eovaraga is limitad to rosaareharss organizations s 
and funding sourcns involvad in eancar rasaarehs and to 
dascriptions of that rasaarch, 

A non-bibliographie data basa with multiplo fiald 
eovaraga uas tha Total narkating Analysis Rasaarch Sarvicaf 
uhich is a "full taut databasa" listing contract awards by 
tha Oapartmant of Oafansas and uhich covars Aarospaea and 
Aeronautical Gnginaarings Agriculture and Agricultural 
Gnginaaring* Biology s Business • Econoaics and tlanaganants 
and Electronics and Electrical Engineering. Any Of tha 
Cordurn Publicationss Inc. data basas aay sarva as an 
example of a non-bibliographie data base with singla fiald 
eovaraga. For exaaplas ona of thasa data basas is callad 
Discontinuad Thyristor# uhich provides anginaaring and 
purchasing information discontinued models of this 
particular elactronie component. Its fiald oi eovaraga is 
discontinuad thyristors axolusivaly# providing parformanca 
specifications# engineering data# type numbers# and 
manufacturers . 
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Th« result* of this cross tabulstion are shown in Table Z 
Note that the data bases are split evenlp between those with 
nultiple versus single iiald ooverago* as well a* those that 
are bibliographic versus non-*bibliographic, 

TABLE 2 

DISTRIBUTION OF DATA BASES 
Cross Tabulationt 

Bibliographic vs. Non'-Bibliographic# 

By 

nultiple vs. Single Field Coverage 


Coversge 


nultiple Field 
Single Field 


Dlbl iographic Non-Bibl iog raphic 


17 4 

4 1? 


Again, although tho numbers nay not be exact due to 
definitional problems, there does seen to be a tendency for 
bibliographic data bases to cover multiple subject fields, 
end for non-bibliographic data bases to cover a single 
subject field. 

Perhaps this result is not altogether surprising. One 
would expect a non-bibliographic data base to serve a well 
defined (i.e, , single subject) field, since by nature it is 
oriented towards this. Somewhat nore surprising is the 
finding that most bibliographic data bases cover multiple 
subject fields, as there is no a priori reason for this to 




24 



b« so. Ono posslbls •Mplsnstion is thst bibliogrsphio dsts 
basts May sarva ganaral rasaaroh or litaratura raviauing 
purposas. For thasa purposas* a bibliographio data basa 
eoyaring a singla subJaoi liald could ba too narrou to ba 
practical . 

As an illustration oi this problam it is intarasting that 
tha Stanford Univarsity Enginaaring Library's principal 
librarian found that parfoming an intardisoiplinary saarch 
on Lockhaad's DIALOG syston posad a «aJor difficulty. Tha 
problam uaa that, too many of tha systam fllas (individual 
data basas) had to ba rafaranoad to covar all tha ralavant 
matarial and subjaot araas dasirad. In this oKampla# it 
could ba said that tha DIALOG systam data basas still do not 
covar anough fialds. Cnhanoamants to tha DIALOG systam 
currantly aiiabla usars to saarch a small sat of filas 
simulatansously. and this is tha first stap touards tha 
ability to parform simultanoous intardiseiplinary saarohas 
on multiple data basas. 

This question of tha contant and structura of a data base 
could ba ralavant to NASA* particularly if NASA uara to 
eonsidor aggrassivaly tailoring RECON filas to users' needs* 
as more usars have tha opportunity for diract dial-up aceasa 
to RECON. Tha first stap in this procass uould ba to 
identify those potential users and their naads* and 
datarmina if NASA's filas are oi appropriata content and 
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struotura. 0 | oauraa# •• lon 0 only NASA print 

eontracton hava this aoeaai* thara la a high probability | 

that no iila Modiiiaationa would ba naeataaryt Hoat NASA 
print oontraatora and NASA paraonnal ara» altar all« in 

I 

oloaal>; ralatad araaa ol uorh. I 

! 

■ f 

■ I 

A lootnota to thia diaouaaion ia that aavan ol tha 
aavantaan non^bibliographie aingla Hold data baaaa uara 
produoad by Cordura PUblioationa t Xne. Thaaa data baaaa 
uarai Oiaoontinuad Dioda> Oiaeontinuad Thyrlator* 

Claatonara» Intarlaoa Intagratad Cireuits# Lintf> Xntagratad 
Cirouita* tiiorouava Tubaa> and Optoalaotronica . All ol 
thasi providod ipaeilie taehnioal and production inlonnation 
in alaotronica and matariala. Thaaa appaarad tha moat 
apaoilieally targatad data baaaa in tha aampltt Judging 
Iron Cordura'a 1978 and 1979 Annual Raporta and tha V ■ 1 u a 
Lino survov . thaaa data baaaa hava baan quita aueeaaalul lor 
Cordurtf and art tha oonpany'a moat prolitabla and promiaing 
antarpriaaa. Cordura’a data baaaa art an axanpla ol 
targatad eomputarisad i'~'*.oraation banka» which art valued 
enough by aoiantiata and tachnicians that thay art 
comnareially succaaalul. 

Tha aouroa ol tha inlormation liatad in aach data bnae is 
describ'^d in Tabla 3. Tabla 3 was darivad Irom tha 
publication. ^^ amputor Readablo Anil. AjujC2.> A ALIUL 

Data Saureobook . and data waa availabla lor 22 ol tha data 
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TABLE 3 


SOUIICE Of INFORMATION LISTED IN DATA BASE 
BY DATA BASE, IN PERCENTAGES 



Journal 

Articles 

Gov't. 

Reports 

Patents 

Monograms, 

Conference Reprints, 
Proc. Conference 

Theses Parsers 

Neuuifac- 

turer's 

Catalogs 

Press Rpts. 

Broadcasts 

Releases 

Other 

1. API Tech. Index 

93 

1 

0 

1 

5 

0 

0 

0 

2. Agricola 

90 

0 

0 

10 

0 

0 

0 

0 

3. Biosis Previews 

94.8 

.2 

0 

5 

0 

0 

0 

0 

4. CA Condensates 

72 

2 

16 

10 

0 

0 

0 

G 

S. CA Subject Index 

72 

2 

16 

10 

0 

0 

0 

0 

6. Cancerproj 

0 

0 

0 

0 

0 

0 

0 

100 

7. Chemical Abstracts 

Service Source Index 

0 

0 

0 

0 

0 

0 

0 

100 

8. Chemical Industry News 

100 

0 

0 

0 

0 

0 

0 

0 

9. Cllnprot 

0 

0 

0 

0 

0 

0 

0 

100 

10. Conference Peipers Index 

0 

0 

0 

0 

0 

0 

0 

100 

11. Defense Market Measures 
System 

0 

0 

0 

0 

0 

0 

0 

100 

12. Drug Product Information 
File 

1 

98 

2 

0 

0 

0 

0 

0 

0 

13. Energy Line 

44 

42 

0 

3 

9 

0 

1 

1 

14. Food and Agricultural 
Chemistry 

74 

1 

15 

10 

0 

0 

0 

0 




19. Polyaer Science and 

43 


Technology 

1 

20. Science Citation Index 

100 

0 

21. Tn> Keytem Index 

54 

1 

22. Transdex 

0 

100 



0 


20 


6 


G 


0 


1 2 2 ' 0 0 3 

111 007 

0 0 0 0 0 100 

50 1 0 0 0 0 

0 0 0 0 0 0 

30 4 1 4 0 6 

0 0 0 0 0 0 


baiAs. Of th« data baaas liatad* only 'Ball and Houall’a 
Transdax uaa comprisad oi a highar paroantaga of govarnnant 
raporta than uaa NASA/RECON. tOOX oi tha Tranadax antriaa 
uara liatad aa govarnmant raporta » uharaaa for NASA/RECON 
thia figura uaa 90X. Enargy-lina uaa a diatant third in 
thia ranking* uith 42X of ita antriaa coming from govarnmant 
raporta , 

The high percentaga of government raporta Hated in 
NASA/RECON set RECOH apart from tha remaining data basea. 
82 fi oi these data bases had or lass of govarnmant 
reports, 50% had 65% or more of journal articles, and 27% 
had t00% ’^other.*' Thus, NASA is in a distinct minority in 
terms of its high percentage of govarnmant raports. Uhathar 
this is perceived by potential users to be an advantage or 
disadvantage is unansuared, but is obviously importont in 
terms of ''markating" tha RECON system. 

Table 4 shows tha distribution of data bases insofar as 
tha first year of coverage is concarnads No clear trend by 
five year grouping is apparent, although there does seam to 
be a growth by decade (10 prior to 1960, 13 in the 1960's, 
and 17 in the 1970«s). 

Table 5 illustrates tha distribution of data bases by 
their number of entries in 1978. Tha largest data base in 
the sample uas the CA (Chemical Abstracts) Subject Index 
Alert, uith 17,896,000 antriaa. Other lerga data bases uara 
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TABLE 4 


DISTRIBUTION OF DATA BASES 
By First Tsar ol Covaraga 


First Yaar oi Covaraga Numbar oi Data Basas 


Prior to 1960 

10 

1960-1964 

7 

1965-1969 

6 

1970-1974 

1 1 

1975-1979 

6 


/ 


tha Scianca Citation Indan with 5 million antrias* and CA 
Condensates uith 133^ 600 . 


TABLE S 

DISTRIBUTION OF DATA BASES 
By Numbar ol Entrias» 1978 


Number of Entries 


Numbar of Data Bases 


Less than 100^000 22 
too, 000-500, 000 10 
500,000-*1,000,000 4 
1 , 000 , 000 + 6 


The distribution of tha 22 smaller data basas (those uith 
100,000 entries or lass) is axhibitad in Table 6 
Eighteen of these tuenty-tuo smaller data bases had less 
than 50,000 entries. Figure One plots tha number of data 
bases on a log scale of size. The distribution is 
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Number of Data Bases 





TAILE 6 


I 


k 


DISTRIBUTION OP DATA BASES 

By KuMbor ol Entries* 1978 
Less Then 100*000 Entriis 


Number oi Entries 


Number oi Dete Bsses 


Less than 10*000 7 

10 , 000 - 20,000 6 

20.000- 50,000 5 

50.000- 100,000 4 


I 


essentially oonstsnt,- at a level oi approximately six date 
bsses ior each size grouping. It should be noted* however* 
that most oi the data bases with less than 1 0,000 ent'*ies 
were the hiDhly-speoialized, non-bibliographic Cordura 
Publications Inc. data bases, as well as the extremely small 
ARPANET Requests ior Comments (which is essentially an 
on-line suggestion box). 

Table 7 provides the distribution oi data bases with 
respect to the number oi annual additional entries: 


TABLE 7 

DISTRIBUTION OF DATA BASES 
By Number oi Annual Additional Entries 


Number oi Annual Additional Entries Humber oi Data Bases 


Less than 50,000 

50,000-100,000 
100* OOGt 


29 


24 

6 

4 




This distribution ib raprsssntsd in Figurs Tuo> uhioh plots 
ths nunbsr oi dsta bssss on m log soala oi ths numbsr ol 
annual additional antrias. In oontrast to tha plot of 
numbar of antrias (Figura Ona)r tha plot of nunbar of annual 
additional antrias doas not appaar constant. Tha data basa 
uith tha largast nur.bar of annual additions uas tha CA 
Subject IndaK Alartf with 2»600>000 additional antrias 
annually. CA Subjaot Indan Alart uas also tha largast data 
base in size as of 1978. Data uas not available for evary 
data basa. 

Data representing the consumer’s price to acquire* lease* 
or license each data basa uas collected from Computer 
JHAla r/Pg.Sfl ’ k Bir.tgtorY iM iLaiil sourcebook. This 
data uas used to try to estimate a first order relation 
betuean the "price** charged for each data base and a data 
base characteristic that in theory uould contribute to the 
cost ol producing tha data base. In this paper the size of 
the data base and the numbar of annual additions to tha data 
base uera selactad as oharactaristics hypothesized to most 
likely affect the price charged to data base users. 

The desired result is to test uhether price charged for a 
data base equals the marginal cost of producing that data 
base. The marginal cost of producing the data base is 
approximated by a simple cost function uith a single data 
base characteristic as its argument. One uould prefer time 
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Number of Data Bases 



Figure Tno 

Distribution of Data Bases 
By Number of Annual Additional Entries 
Log scale of nmber of annual additional entries 




s«ri«i d&ta ior tech data bate to ptriorm this mtaturt* but 
only a oroii aaotional aampla ior one year ia available. 
rurthtraore» the **prioa" variable ia not an equivalent, 
measure ior each data bate. The iigures reported uara ior 
either acquisition* lease* or license oi the data base. For 
simplicity the acquisition price was treated equivalently to 
one year’s lease or license price* even though an acquired 
data base uould provide additional years oi service at no. 
additional yearly charge* but additional years oi service 
from a leased or licensed data base uould incur such 
charges. Finally* only the user’s ”iixed” costs oi gaining 
initial access to a data base’s iniormation uere considered. 
Most oi the leased or licensed data bases charged an 
additional hourly search charge* or ’’variable'* cost to the 
user. These additional costs uere not considered. 

The first set oi regressions uere to determine if there 
uere a relationship betueen the size ..'ti the data base and 
the price charged the user. Simple regressions uere run for 
the seventeen data bases for uhich iniormation uas 
available* and than ior data bases classified either es 
private* or non-profit and government. F Tests on each 
regression shoued that none of them uere significant at the 
90X confidence level. Results are belout 

y ■ «K + b 

y ■ acquisition* lease* or license costs 


3t 


H ■ nu»b«r of data baaa antriaa 


All data basaat 


(1) y • ,0023k 3630 

N - 17 

Fd.rS) > 3.466 insignMicant 
Privata data basast 

(2) y ■ ,0019 k -t- 6 031 

N • 11 

F(1,9) ■ 1,316 insignificant 
Govarnment and non-profit data bases i 

(3) y 1 -,0007k + 1450 

N > 6 

F(1,4) a .676 insignificant 

Ragrassions uara run with tha nuabar of annual additions 
to tha data basa as tha indapandant variabla» to saa if this 
variabla might not ba a proKy for marginal costs# and 
produce a correlation uith tha data base price charged. 
Thera uara insufficient data points to do this for data 
hoses produced by government and non-profit organizations# 
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to tho rogrotsiont uoro only poriorttod on all 13 data batat 
ior uhioh thara ur.^ data» and ior tha 11 privata onas. All 
ragrattiona uara intignilioani at tha 901t oonlidanea laval. 

y « MM > b 

y > aoquitition* laota or lieanta coats 
H ■ numbar of annual additions to data baaaa 


All data basasi 


(H) y • .0186k -t 4322 

H ■ 13 

r(t.11) ■ 1.482 insignificant 

Privnta Data Basest 

(5) y » ,0171k ♦ 5033 

M ■ 1 1 

F(1»9) a .9962 insignificant 

Tha insignificanca of those ragrassions nay be attributed 
to any of the follouing reasons: 

1. Misspecif ication of variables 

2. Errors in tha data 

3. ilisspocification of tha modal 

4. ilisspacif ication of tha functional form 

5. l.ack of any true relationship 
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Possibltt ttisspsoilioation o| kha variablaa and arrora in 
tha data Hava alraady baan diaouaaad* and a eorraotion for 
poaaibla hataroskadastioity (problaha with oorralating 
variablas irom data basai of difiarant sizas) yialdad no 
battar rasults. Tha sinpla regraasions usad Kara May not 
railaot all tha eoating and prloing oomplaKitiaa that 
actually ooour> indicating an inaocurata modal. Tha 
functional lorn of thaaa ralationshipa nay not ba linaar» 

' but an axparimant uith log forma did not improva raaulta. 
Finally, it la poaaibla that thara is no trua ralationship 
batuaan thaaa variablaa, although tha argument for thia on 
tha baaia of currant avidanea ia uaak. XI thara uara to ba 
no trua ralationahip, houavar, aconomic quaationa uould 

ariaa conoarning tha axtant of potantial inaf f icianoiaa in 
tha atill young naohina raadabla data baaa industry. 

Tuanty of tha forty-tuo data bases uara listad as baing 
publically availabla on-lina, through aarvicaa such as 
Lockhaad^s DIALOG. Tha most commonly oitad on**lina sarvioa 
uas Lockhaad's, but tha following uara also listad i System 
Davalopmant Corporation's (SDC's) ORBZT, tha National 
Library of Kadicina's nsOLIHE UICULARS on-lina), tha 

Bibliographic Ratriaval Sarvica (BRS), Triangla Universities 
Computation Cantar, Ganaral Electric Mark XXI, and Drilling 
Activity Analysis System. It uould ba fallacious to assume 
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that tha tuanty*tuo data baaaa lor which no on**Iina vandor 
was iistad ara not availabla in aoaa on'’lina capacity. For 
aKaapia» Cordura Publioationa » Inc. oiiara on**lina sarvioa 
to its data baaaa through a contractual agraaaant. Although 
thia typo of aarvica ia guita dilfarant Iroa tha **data baaa 
auparaarkot** aarvica olfarad by Lochhaad>' SOC* or DRS# it 
ia atill an on**lina aarvica. 

Zt may ba that aoaa data baaaa can atill ba aoceaaad only 
by .batch methoda. Further invaatigation would ba raquirad 
to datermina tha oKact parcantaga of data baaaa available 
on-lina. However, it is inatructiva that in tha Ooazkoca. 
Rapp, and Schoolman article cited earlier, tha authors note 
that tha majority of tha S28 bibliographic data baaaa they 
ara familiar with can ba aaarchad on-lina.* Tha authora make 
no mention of tha proportion of non-bibliographic data baaaa 
available on-line, but one would eMpaet thia proportion to 
ba high duo to the nature of thaaa data baaaa and their uaa. 


* Sea AppendiK D 

* Doazkoss. Rapp, and Sehoolmani ibid . page 25 
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5. DATA lASE PUILZSKERS 


ZnforMation on tho publlihorf ol tho data baaai in tha 
aaapia uaa obtainad by raiaranoins tha "Databaaa Publiahara** 
• action oi Ini.D.riRtifln Hirktt fJl,ICJ> Tuanty-aiK pubiishara 
uara raaponaibla iov tha lorty-tuo ••Myla data baaaa« and 
thair naitaa ara liatad in alphabatioal ordar in AppandiK E, 
Iniornation uaa oollactad ior aaeh pubiiahari and thia 
iniofmation inoludad tha total numbar ol machina raadabla 
data baaaa and aaaociatad print produota prodMcad> and thn 
availablility ol apaoial traininp aaaaiona* additional 
publications* and cuatomisad sarvicaa, 

The tuanty-lirst publiahar liatad in AppandiK E is tha 
NASA-IAC/Knouladga Availability Cantor. It is dascribad as 
publishing tuo aaohina raadabla data basas* and no printad 
assooiatas to thasa data basas. Spacial sarvioas includa 
seminars and uorkshops by arrangaaant through tha markating 
dapartmant* machina and manual soarching* salaetiva 
dissanlnation oi inlormation <S0Z» or "currant auaranasa" 
searching)* analytical reports* and taohnical specialists to 
claril>% summariza* and analyze tha results ol a search. 
This list can already be seen to ba inoonplata> lor 
Vnlormstion Market Place does not dascriba NASA as ollaring 
ratrospcctiva searching. "Ratrospactiva saarehing" seams to 
have been a standard category in Inlormnt ion Market Placets 
questionnaire* so it should have bean listed. This service 
is described as a NASA service in Soinof f 1 979 ^ " 
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01 th* tuanty^siK data baia publishars* 

♦ 

privata prolit and non**profit organizational govarnmantal 
aganoiai# and tha Unitad Hationa, Hot anough uaa knoun 
about tha linaneing ol thaaa publiahara to dataraina thair 
ralativa numbaroi but it aaaMad *’^at tha privata 
organizationa uara tha majority. 

Tabla & illust.ratas tha diatribution of data basa 
publiahara by tha numbar of machina raadabla data baaaa thay 
producai 


T.aBLE 8 

DISTRIBUTION OF DATA BASE PUBLISHERS 
By Nunbar oi Hachina Raadabla Data Baaaa Publiahad 


Numbar oi Data Baaaa Publiahad Numbar oi Publiahara 


Ona 7 
Tuo 5 
Thraa 8 
Four - Nina 3 
Tan or Nora 3 


Tha thraa publiahara producing mora than tan eomputarizad 
data baaaa aro Cordura Publieationa Ine. with 26* Chamical 
Abatraeta Sarvica with and tha National Library oi 
Medicine uith 12. Tuanty oi tha tuanty">aix data baaa 
publiahara uara raaponaibla ior thraa or iauar data baaaa. 
ThU3> the induatry seams to be characterized by a large 


SJSlDjaJJ. im. page 112 
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nunb«r oi publishars uho produea • saall numbar oi data 
basasf and by a small collaction oi publishars that produea 
a larga numbar oi data basas« AppandiM F dtsouasaa a 
poasibla aoonomio modal to anaiyca this iindinpi* and raisas 
issuas ior iurthar invastigation. 

Tha iinal tabla in this survey oi data basa publishers is 
a summary oi dots base related activities undartahen by the 
publishers. Table 9 lists the number oi data base 
publishers that engage in tha. activity or service listed 
(numbers include the HASA iaoility). Those entries marked 
by an asterisk indicate NASA-I AC/Knoul edge Availability 
Center activities, as reported in Iniormation Market 

The iirst comment to be made on thase results is that no 
definitions ior the listed categories uere provided. Thus, 
there may not necessarily be a clear distinction batuaan 
certain oi the categories. Furthermore, it is questionable 
that these descriptions are complete. As discussed 
previously, this data appears to have been completely 
self-reported by the respondents. Thus, there is reason to 
question the consistency of the responses. It is likely 
that if these activities uere well defined, and if one 
intervieuer had avaluated the activity of each publisher, 
then the numbers in some categories uould be higher. For 
example, if m publishers engage in '^machine searching" of 
data bases, it would seem likely that more than three engage 
in "retrospective searches." 
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TABLE 9 


NUnBER OF DATA BASE PUBLISHERS 
EngaginB in Littad Rotivity 


Activity 

Publishas Print Products 
(oftan a papar varaion 
oi tha maohina raadabla 
data basa) 

Training Programs 

* Saminars and Uorkshops 
User’s Guide 
Neuslatter 

* nachine Starching 

^ nanual Searching 

Retrospective Searching 
Selective Disscminotion 
of information 
Document Delivery 
Facsimile Service 

* Analytical Reports 
IndeKing 

Telephone Intervieuing 
Thesaurus for Indening 
On Line Dooument/Hardoopy 

* Access to Technical Specialists 
Technical Conference Support ^ 


Number of Data Base PUblishars 


20 

9 

14 

6 

4 

14 

8 

3 

9 

5 
2 
5 

10 

4 
2 
1 

5 
1 


Houever» despite tha potential problems cited abova^ one 
can note three items of interest. First* in providing 
"technical specialists to analyze and sum results* and to 
clarify questions*" NASA is one of onxy five publishers 
listed that offer this type of customized service. 


Second. only 
provided on-line 
publishers (not 
del i very— such as 


the ARPANET Netuork Information Center 
document/hardcopy reproduction. Eight 
including NASA) provide donument 
the Institute for Scientfic Information. 
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EHoapt for th«sp •iiorts> hou«vcr> on-lin« acoasi to 
dooumants ramaint a rara sarviea. 

Thirds tha NASA lAC/Knoul adga ilvailability Cantar j^atd tha 
Patrolaua Inioraation Corporation uara tha only publiahara 
to oflar both analytical raports ond accaas to tachnioal 
apaoialists. Thia indicataa that NASA'a coupling oi 
analysia by tachnical apacialists with ita data baaa ia a 
ralativaly uniqua acrvica in th<' data baaa industry. 
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6. DISTRIBUTORS OF nACHINC READABLE DATA BASES 

This study uss not orisntsd touards tha analysis of 
aaohina raadabl# data basa distributors^ aacapt insofar as 
pravious discussions hava addrassad this saotion of tha data 
basa industry* Historically, tha lAC*s hava baan tha acoass 
point to RECOH/NASA for all non-*NASA usars. Uith dial-up 
aecass to RECON baing provided for mora and aora usars, tha 
rola of the lACs could ohanga. It is important to clarify 
Mho Mill have dial-up access to RECON (it may never be 
unviversal), and to uhat antant this Mill change tha nature 
and eKtont of the lAC's "clientele.** 

A parallel question is uhat tha affects on tha lAC's rola 
would ba if a machine readable version of tha Tech Briefs 
were offered through a distributional vendor such as 
Lockheed* SDC, or BRS. An analysis of thesa effects is 
conditional, of course, on tha extant to which one believes 
there would be a commercial markat for this information. If 
there were little markat potential, than no commercial 
vendor would offer such a file through their system (barring 
subsidies) . 

Although it is not tha intent of this study to 
investigate future strategies for the lAC's, it does seam 
important to recognize that certain uevalopmanta in the 
access to RECON/NASA's information could impact tha role 
played by these institutions. 
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7. CONCLUSIONS 

1. Taohnioal and soiantilic data baaas oovar a uida 
ranga oi matarial through a ?iiida variaty oi <oraata» 
80 opa> and aizas. Oi nota ia tha finding that 
bibliographic data baaaa tand to oovar multiple 
aubjaot araaa» uharaaa non*bibliographie onaa tand to 
hava aingla field coverage. 

2. RBCON/NASA containa acme information of minimal uaa 
to thoaa outaida NASA. Houavar. for that information 
that is oi non-NASA intaraat^ NASA data baaaa are 
much like other sciantific and technical data baaaa 
in their structure^ but historically have been much 
diffarant in terms of thoir aecass. 


3. Despite soma user aggravation in certain types of 
searches. tha struetura of data baaaa seams tailored 
to user needs. Ongoing rafinamants and the 
development of new types of data bases should even 
more carefully tailor data bases to user needs. 

4. Tha data base industry has a iau large publishers of 
many data bases, and many small publishers. This 
observation carries uith it as yet unanswered 
questions. 


5. 


In terms of 
NASA seams 


the activities of data base publishers, 
well specialized in its provision of 


i 


I 

; ; 
J 

i 


•i 
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tftchnioal staii for analytical and eonsultativa 
purpoaas . 

6. mth ineraaaing dial-up aoeaaa to RCCOK/NASA» thara 
nay ba raaaon ior ra-avaluating tha rola oi tha 
IAC*s, XI this is dona> it is important to emphasiza 
thair current spacialization in analytical and 
eonsultativa services. 
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Appandix A 


Source: 


I 

CONTENTS 


PREFACE 


.. vii 

1 INFORMATION PRODUCTION (1-3) 


(1) 

Duiubuse Puhli.shers 

... 1 

(2) 

Muchine Reudtibic Outubuse,s 

..36 

(.'') 

Prim Products 

,. 63 


Dutiibuses & Print Products — 



Clussitlcd by Subject 

, . . 102 

II information DISTRIBUTION (4) 


(4) 

Online Vendors. Library Networks & 



Telecommunicution Networks 

...121 


III INFORMATION RETAILING (5-6) 

(5) Inl'ormuiidn Ciillccdon & Anulysis Cenierii ... 133 

(6) IniWmution Brokers 147 

Int'ormulion Rctuiling — 

Clussitlcd by Subject 138 

Int'ormulion Retuiling — 

Clussitlcd by Services 168 

I V SUPPORT SERVICES & SUPPLIERS (7-9) 

(7) Terminui Munuiucturers .................. 174 

(8) Consultunis & Other Support Services ...... , 177 

Consultunts & Other Support Services — 

Clussitlcd by Services 180 

(9) Foreign Represenluiives ............180 

V ASSOCIATIONS & GOVERNMENT 

AGENCIES no- II) 

( 1 0) AsMteiulions ........................... 184 

1 1 1 1 Government & Iniernutionul Agencies ....... 188 

VI CONFERENCES & COURSES (12-13) 

(12) Cuicndur 190 

(13) Courses ...|93 

VII SOURCES OF INFORMATION ( 14- 15) 

( 14) Reference Biwks .196 

( 1.3) Periodiculs & Newsletters .199 


GEOGRAPHIC INDE.X . 


.203 


NA.VIES & MEMBERS .. 


.221 


information Market Placet 1978-79 


44 


App«ndiM B 


Subject area headings irom inforaation tlBxJut Piac.B» 
197S-1979 Used to select data bases of direct relevance to a 
private sector scientist or technologist t 


Aerospace and Aeronautical Engineering 
Agriculture and Agricultural Engineering 
Asironomx 
Biology 

Chemistry and Chemical Engineering 
Civil Engineering 

Computers* Data Processing Systems 
Current Research Projects 
Earth and Space 

Electronics and Electrical Engineering 

Energy 

Engineering 

Environment 

Food Science 

General Science and Technology 

Geology 

Life Sciences 

Mathematics 

Mechanical Engineering 

Medicine 

Metal lurgy 

NUcleor Science 

Patents 

Petroleum 

Phys ics 

Pollution 

Technology 


Append iK C 


The dktn beeei leleoted ior the lample studied in this 
report* and their publishers* aret 


DATA BASE NAtlC 


DATA BASE PUBLISHER 


1. API Teoh Index 

2. ASTM Inirercd Data Base 

3. Agrieola 

d, Biosis Previews 

5. CA Condensetes (CA Con) 

6. CA Subject Index 

7. Caneerproj 


American Petroleum 
Institute— CA IS 

Sadtler Research 
Laborntor ies * Inc. 

National 

Agricultural Library 

BioSciences Information 
Service (BIOSIS) 

Chemical Abstracts 
Service 

Chemical Abstracts 
Service 

International Cancer 
Research Data Bank: 
Smithsonian Science 
Information Exchange 


8. Chemical Abstracts Service 

Source Index Chemical Abstracts 

Service 


9. Chemical Industry News 
(CIN) 


Chemical Abstracts 
Service 


b6 - 


10. Clinprot 


International Cancer 
Research Data Bank 
Program (ICROG) 


n. Comput«r and Iniornatton 
Systama Abstracts; 
Elaetronics and 



Consiunieations Abstracts 

Cambridge Scientific 
Abstracts* Inc, 

12. 

Coniaranca Papers Indas 

(CPI) 

13. 

Oaianse Markat naasuras 



System 

Frost and 
Sullivan* Inc. 

14. 

Discontinued Dioda 

Cordura 

Publications* Inc. 

15. 

Discontinued Thyristor 

Cordura 

Publications* Inc. 

16. 

Drug Product Information 



File 

Amar icon Society of 
Hospital Pharmacists 

17. 

Elastomers 

Cordura 

Publications* Inc. 

18. 

Energy Conservation 

Energy and 
Environmental 
Rasponsa canter 

19. 

EnargyLina 

Envi ronmant 
Information Canter 
(EIO* Inc. 

20. 

Environmental Impacts 

Energy and 

Environment Response 
Canter 

21 . 

Food and Agricultural 



Chemistry 

Chemical Abstracts 
Service 

22. 

Geological Roiaranea File 



( GaoRai ) 

American Geological 
Institute 

23. 

Hydrological Inlormation 
Storage and Retrieval 



System (HXSARSI 

Biological and 

Agricultural 

Engineering 

24. 

IRIS Infrared Information 



S y s t am 

Sadtler Research 
Laboratories. Inc. 
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25. Int«rl«o« Zntigratsd 



Circuit! 

Cordura 

Publications* Znc. 

26, 

Lin«!r Zntasratad 
Circuit! 

Cordura 

Publications* Ino. 

27, 

naritim* Ra!aarch 
Zniomation Sarvioa 

MRZS 

28. 

iladioal Subject Heading! 
(naSH) Vocabulary File 

National Library 
of Medicine 

29. 

Microuova Tubai 

Cordura 

Publications* Inc. 

30. 

NASA RCCON 

NASA ZAC/Knouledge 
Availability Canter 

31, 

Optoaloetronic! 

Cordura 

Publications. Inc, 

32. 

Pharmaceutical Naua 
Index (PNX) 

Data Courier* Inc. 

33. 

Polymer Science and 
Technology 

Chemical Abstracts 
Service 

34. 

Production Standard! 
Format 

Petroleum Information 
Corporation 

35. 

Request! for Comment! 
(RFC’s) 

ARPANET Netuord 
Information Center 

36. 

Science Citation 
Index (SCI) 

Institute for 
Scientific Information 

37. 

Small Business 
Data File 

International Data 
Corporation 

38. 

TTD Kaytarm Index 

Institute of Textile 
Technology 

39. 

Total Marketing Analysis 
Research Service 

DMS Inc. 

40. 

Transdex 

Bell and Howell Micro 
Photo Division 
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m. Ucil History Control 
Systom (WHCS) 


Potroloun Iniorantion 
Corporation 


42, Uorld Cnargy Supplias 
Systan (Hor Idanargy) 


Unitad Hationa 

Statiatieal 

Ollioa 
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App«n4iN P 


DIALOG DATABASES - NUMERICAL LISTING 


7 rriuf 

ACfPtiUU CUmi 
t« MIC 64«7f/m7 

41 

LtlA 49*79/ni^ 

s 

CA IIAMI IT»T1 

42 

wn n*Tt/AM 

] 

CA m»OI 72*74 

43 

MU Aitnum 70-79/JIM 

4 

CA AtAaCR TT*7t/Veb HU«) 

44 

oiiLD Awn AMI nouer iiitimh uti n. 

S 

•IMII rUVtIM T4-7f/OCr 

11 

Hit emwir mhakii 

1 

NTIS f4>7«/tMn 
MciAA femrJucw Tt«Tt/in34 

44 

890 MOHtiiY CATM08 ittU4*79/119t 

7 

47 

NHip nniut n-if/ni 

• 

COKftMfM 70-T9/m 

41 

0t T4«ii/jm. 

1 

AI.«1^4IIM 47*74 

49 

eetMUM 7i>?9/m 

10 

A09tCOlA 79A*UL 

70 

NICHN/HMM IfTt n. 

ll 

HiCn Aif 47*79/119 

71 

MA lAti. •IAUittr*P*)y ri*7Y 

neiim micA 7s>7f/iim 

u 

tmric 4i>77 

71 

ll 

I039IC 74*19/»lll 

73 

IXClilTA MIOICA IN-900CU1 79/11139 

14 

lAMC'Nien iite* 7)>7f/ecr 

74 

19A 79*79/JUH 

IS 

AAI/<Mt-««N V>y*/np 

71 

mxr coNTiim 74*7»/itr 

ti 

fwm 7J-74/JCT 

74 

(•OClllM^I 

17 

m 99l9AtS«r OCT 29 

77 

eOMflMIlCt PAffM 2H0E1 71-79/AUQ 

11 

r t f tNflKXn 74-7t/OCT 

71 

MTiaMAL rOPMOAflOMl 1979 EO. 

19 

CfItN IKO OOTiS 74*79/11044 

79 

rOOM ADUIM 74-7«/9m 

:o 

riDKlAL ;NI>6X 74*79/A00 

10 


n 

f*0CtUnt*) 

• 1 

m Ul ITAT AM 71*79 

u 

III rCAlfTl MAV79 (TVftl 40.90 CAOO 

13 

m U1 AML TXMB IIHCM 3tm79 

23 

CtAtrm/Ctlt« 1990*1970 

•3 


24 

CtAf?nf/U«jl« rAt 1971- 1977 

94 

(••rfliM”} 

29 

CtAIKVU.S.rAT Ail 74*79/AUCl tfZ ritll 23#244I9 

IS 


29 

rOUMOATlOK PimCTOAIf 1979 EO. 

14 

m Xim ITAT AM 71*79/408 

27 

fWlOATtON niAlffl 71-79/AM 

•7 

9tf riH Am. TIM IKOIU 11979 

29 

OCtAMiC AM 44*79 

11 

(••IfVlM*) 

29 

MR/ecoAtTin AM 7e«7f/m 

19 

(•omiMM 

30 


90 

tOOMOClIC AMTMCn UTK 73*79/9t9T 

31 

ctiMCAMr{(TSi rru 

91 


32 

tyiTAOCK 44*79/t7UM 

93 

III HOHWAWJiACrWUHO NAT79 (Tim 90 .11 lACM) 

31 

imto AtUHIltm Aif 40*79/JUL 

93 

0.1* FOlillcil liitfiOi McimmiM 79*77 

34 

9CIMA1M 70*79/im34 

94 

iCIilAim 74*77 

19 

COKV^ OlUStr AM 1141*1979/909 

99 

HA9HA AMTIACU 72*79/119 

30 

UM0A6K AM 73*71/11104 

M 

(••ffliiii*) 

37 

lOCIOCiOOlCAL AM 43*79/10101 

97 

HIUI 1/20/79 

31 

AMBIUCAi HUT i liri 43*70/21101 

91 

r A • timXM 72*79 

Jf 

llirORICAb AM 71-71/TllOJ 

99 

H8I.0A1IAIIC8 l970*79/JtM 

40 

IWIHOUItt n*79/AOO 

100 

DliCUOOOW 79-79/VR43 

41 

fOCLOriOM AM 70-79/JUL 

111 

(MlfliMM 

42 

fHMN intHl tNOM 74-79/lIFr 

111 


43 

CA PATEirr OOMC9MAICI 72*71 

103 

(••ffliM*) 

44 

A0UAT2C ICI AM 71-79/AM 

114 

(••miM*) 

49 

ATTIC 44-71/CCT 

Ilf 

mtlOH TIAOCM tOSEI 79/OCT 

40 

Hiom 1977 n. 

104 

THA08 Of9OmHET2M/77-7900 

47 

NAOAMNI IHOU 77*79/OCT 

107 

THAIB O99OWWilTItl/79t030 
AOMOOCA 70*7i/DOL> 

40 

9IHA 79-79/lirT 

1X0 

49 

9AI1 IHTUMATIOHAL 74*79/2tft 

ill 

MTIOMAIi HffMtfAftl iHOfl 79-79/OCr 

90 

CAl AM 72-79/2UL 

112 

AOHMVXUrt 79 

91 

ffTA 49-79/A06 

113 

(•orfiiPi**! 

91 


114 

MOeXMOXA or AMOeXATTOM EO. l| 

93 


124 

CUIM/ebAM J9m 1970 10. 

94 

ICHII/IICff CHftO 44-79/NAV 

129 

eUXMi/V.l. 9ATf,AM. miLT U/02/79 

9$ 

ilMtl MHVtm 49*73 

200 

DfMOO 9VOUCATIOM 79/tC9 

94 

AM MODEM 74*71 

201 

0WTA9 me 

97 

MILOiOMEl '1 XMOCI 40»79/NAY 

204 

CA 18A10I 19»709 OOnmMTl 

91 

OOAlCHr.7 74-79/MAH 

111 


99 

MOiT 4 mUVM 0M2 79*7i/XMOI 

291 

OOTA9 CMMNAMKTNI 20»490 DIOITAMCM 

40 

U10A/ei2f 79-79/JOIi 

Oil 

HMMAMH 


♦Trademark Keg. UtS. Pat. & Trodemork Office. 

Source: Guide to Dialog Searching . Lockheed Corporation 


SO 
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Appendix B 


The publishers oi the semple dete beses> 
alphabfttic«llx> 

1. American Geological Institute 

2. American Petroleum Institute 

3. American Society oi Hospital Pharmacists 

4. ARPANET Netuork Iniormation Center 

5. 3ell and Houell Micro Photo Division 

6. Biological and Agricultural Engineering 

7. Biosciences Iniormation Service (BIOSIS) 

6. Cambridge Scientific Abstracts^ Inc. 

9. Chemical Abstracts Service 

10. Cordura Publications* Ino. 

11. DMS* Inc. 

12. Data Courier Inc. 

13. Energy and Environment Response Center 

(t 

m. Energy Iniormation Center 

15. Frost and Sullivan Inc. 

16. Institute for Scientific Information 

17. Institute of Textile Technology 

18. International Cancer Research Data Bank Program 

19. International Data Corporation 

20. Maritime Research Iniormation Service 


listed 


51 


21. NASA IAG/Knoul«dg« Availability Cantar 

22. National Agricultural Library 

23. National Library of Madieina 

24. PatrolaUM Information Corporation 

25. Sadtlar Raaaareh Laboratoriaa Ino. 

26. Unitad Nations Statistical Office 
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App«ndiN F 


A Th«or«tioal Diseusaion oi tha Data Basa Inciustry 

Struotura 

An attempt uaa mada to pariorm soma theoratioal analysis oi 
tha data basa industry struotura. This analysis was 
inspired by the works oi Baumol» Fischar» and Draunstain. 
who hava invastigatad cost and ravonua functions from tha 
perspactiva oi firms producing hataroganaous goods. This 
parspActiva amphasisas a multi~dimansional analysis oi tha 
revanua and production costs oi ovary combination oi 
produced goods# by looking at rovanua and cost bohavior 
along and batuaan many **output rays** in **production spaoa.** 
Tha key point is that tha production oi oartain combinations 
oi industrial goods may ba oharactarizad by 
complamantaritias of production and/or complamantari tias in 
consumption. If thora ara such complamantaritias# than tha 
industry's production costs will daeraasa and/or tha 
revenuas will incraasa# raspactivaly# with multi-good 
production. Thus# tha industry will iind it mora proiitabla 
to produoa particular combinations oi savaral kinds oi 
goods. Conyarsaly# ii a combination oi goods is subjact to 
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substitution in production and/or consumption* thon tha 
industry will bo oharaotorizod by liros taeh producing a 
spocializod and uni()uo product. 

li it ia possibla to dononatrato that tharo ara 
eomplamantaritias or Substitutions in production and/or 
consumption oi cartain goods (hara— *typas of data basas)* 
than ona could damonstrate that thara may ba '^natural'* 
limits and **rangas oi production” ior firms in an industry. 
This mould ba countar to historical Economic uisdom* which 
has oitan laballad an industry with a fau larga producars 
and many small ones as ”oligopolistic” and hanea 
ineificient. Does the publishing activity oi tha sampla 
publishers indicate a data base industry oligopoly* or a 
railaction oi natural consumption and production iorcasf 

This author* much to his ragrat* has no ual l^iormulatad 
answer to the above question, 

Thara is great appeal in hypotheses ioeusing on tha 
consumption side* and how this could largely datarmina a 
firm's size of production. Chemical Abstract's data bases 
most likely addrass much different user needs than do 
Cordura's or Sadtlar Research Labs'. Tha sin Chemical 
Abstracts data bases in tha sampla ara bibliographic, 
whereas Cordura's seven and Sadtlar''s two ara all 
non-bibliographic . The temptation is to assume that 
Chemical Abstracts data bases address users who would have 


54 


'*ooHpl«m«ntarititts*' in consumption. This uould indioatm 
that Chamioal Abstracts should produce more data bases and 
types oi data bases than the other two. This distinotion» 
however* does little to illustrate why both Chemioal 


Abstracts and 

Cordura are 

’♦large" (H 

and 26 

data bases* 

respectively) » 

and Sadtler 

is 

"small'* 

(3 data 

bases). A 

proper evaluation of this 

issue 

would 

involve investigating 

all data bases produced 

by 

each 

of the 

twenty-six 

publishers . 

However* on 

the 

basis 

of the 

data bases 


included in the sample* and at this level of analysis* this 
does not appear a fruitful avenue of further research. 

Looking to the production side for possible 
complementarity or subsitution of production* there are soma 
a priori reasons to SKpect either substitutions or 
complementarities in the production of many types of data 
bases. The least controversial reason to expect 
complementarity in production is that producing a data base 
involves e.xtensive organization snd the writing of general 
soitware>'*each of which is a major front-end investment. 
Once made* these investments can be applied to the 
production of other data bases. Furthermore* if a firm owns 
its own computing facilities* then it is usually in the 
company's interest to spread the high fixed costs of this 
equipment over as much work as possible. Validating this 
uould require examining the production processes and 
facilities of each data base publisher. 
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On th« other hand# it oould ba postibla that producing 
certain typea oi date bases involves eKceptional ly high 
production costs lor a very specialised product# thus 
creating the potential lor substitutions in production. For 
esample# suppose there are entremely high fixed costs in 
developing data files and softuare to store and access 
chemical information by molecular formula, This oould be an 
example uhcre it is economic for a firm to produce only one 
or a feu data bases# since the technology involved in these 
data bases may not easily be applied to other data bases. 
Yet# such data file and softuare technology uould probably 
find success in separate data bases covering topics such as 
physical chemistry# organic chemistry# biochemistry# general 
medicine# pharmacology# toxicology# materials science# etc, 
Perhaps p better example uould be to determine if Sadtler 
Research Labs has such a specialized expertise* and such 
specialized technological needs for its infrared spectral 
information data bases# that there uould be diseconomies for 
Sadtler to produce other types of data bases. 

An empirical study of cost functions uould be of use in 
investigating this question. 

There is an empirical study of the journal publishing 
industry uhich is interesting to cite in relation to this 
discussion. Baumol and Braunstein have performed an 
empirical study of scale economies and production 


oomplementaritiat for • •anpla ol nonproiit publiahars oi 
laiantiiio journals. BauMOi and Fisohar aatimatad a 
variaty ol cost iunetions* and iound that about tuo/thirds 
oi tha firms uara olosa to thair "minimum cost locus*" so 
that naithar margins nor splitting thasa firms ueuld raduca 
thair costs. Furtharmora* up to tha "minimum cost locus*" 
or "point of minimum ray avaraga cost*" costs par journal 
declined with the number oi journals par publisher, Tha 
remaining one/third of tha put''f:shars uas said to be in the 
region of "sub-additivity*" such that limited amalgamation 
might reduce tha firm's cost. As tha journal publishing 
industry is more established than tha relatively new machine 
readable data base industry* it uould be interesting to sea 
hou close machine readable data base publishers are to tha 
points of "minimum ray average cost." 


Baumol* U. and is’avi'*: f • n* T. : "Empirical Study of Scale 
Economies and fvro«>’ on -.;*omplamantarity: The Case of 
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